Finding Occurrences of Relevant Functional Elements in Genomic Signatures.
نویسندگان
چکیده
For genomic applications, signature-finding algorithms identify over-represented signatures (words) in collections of DNA sequences. The results can be presented as a specific sequence of bases, a consensus sequence showing possible combination of bases, or a matrix of weighted possibilities at each position. These results are often compared to a biological set of binding sites (i.e., known functional elements), which are usually represented as weighted matrices. The comparison is made by scoring the signatures against each weight matrix to identify the best option for a positive hit. However, this approach can misclassify results when applied to short sequences, which are a frequent result of signature finders. We describe a novel method using a window around the original sequences (those which the signature is based upon) to improve the comparison and identify a more significant measure of similarity. In doing so, our method transforms a list of DNA signatures into a resource of characterized binding sites with known functional roles and identifies novel elements in need of further elucidation.
منابع مشابه
Detection of Genetic Differences between Holstein and Iranian North-West Indigenous Hybrid Cattles using Genomic Data
Extended Abstract Introduction and Objective: Selection to increase the frequency of new mutations useful only in some subpopulations leaves markers at the genome level. Most of these regions are related to genes and QTLs controlling significant economic traits. Material and Methods: In order to detection of genetic differences between Iranian northwestern crossbred and Holstein cattle breed,...
متن کاملWord-Based Characterization of the Bidirectional Promoters from the Human DNA-Repair Pathway
A word-based genomic signature for a group of related genomic sequences is a set of characteristic subsequences. Unlike most existing genomic signatures, a word-based genomic signature provides insights that are directly applicable to the problem of identifying functional DNA elements. The effectiveness of the word-based genomic signature method is shown by analyzing promoter sequences for gene...
متن کاملChromaSig: A Probabilistic Approach to Finding Common Chromatin Signatures in the Human Genome
Computational methods to identify functional genomic elements using genetic information have been very successful in determining gene structure and in identifying a handful of cis-regulatory elements. But the vast majority of regulatory elements have yet to be discovered, and it has become increasingly apparent that their discovery will not come from using genetic information alone. Recently, h...
متن کاملDiscovery and Annotation of Functional Chromatin Signatures in the Human Genome
Transcriptional regulation in human cells is a complex process involving a multitude of regulatory elements encoded by the genome. Recent studies have shown that distinct chromatin signatures mark a variety of functional genomic elements and that subtle variations of these signatures mark elements with different functions. To identify novel chromatin signatures in the human genome, we apply a d...
متن کاملGene structure prediction in syntenic DNA segments.
The accurate prediction of higher eukaryotic gene structures and regulatory elements directly from genomic sequences is an important early step in the understanding of newly assembled contigs and finished genomes. As more new genomes are sequenced, comparative approaches are becoming increasingly practical and valuable for predicting genes and regulatory elements. We demonstrate the effectivene...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- International journal of computational science
دوره 2 5 شماره
صفحات -
تاریخ انتشار 2008